Shakespearizing Modern Language Using Copy-Enriched Sequence-to-Sequence Models
نویسندگان
چکیده
Variations in writing styles are commonly used to adapt the content to a specific context, audience, or purpose. However, applying stylistic variations is still largely a manual process, and there have been little efforts towards automating it. In this paper we explore automated methods to transform text from modern English to Shakespearean English using an end to end trainable neural model with pointers to enable copy action. To tackle limited amount of parallel data, we pre-train embeddings of words by leveraging external dictionaries mapping Shakespearean words to modern English words as well as additional text. Our methods are able to get a BLEU score of 31+, an improvement of ≈ 6 points over the strongest baseline. We publicly release our code to foster further research in this area. 1
منابع مشابه
Comparative bioinformatics analysis of a wild diploid Gossypium with two cultivated allotetraploid species
Background: Gossypium thurberi is a wild diploid species that has been used to improve cultivated allotetraploid cotton. G. thurberi belongs to D genome, which is an important wild bio-source for the cotton breeding and genetic research. To a certain degree, chloroplast DNA sequence information are a versatile tool for species identification and phylogenetic implications in plants. Different ch...
متن کاملThe vlhA gene sequencing of Iranian Mycoplasma synoviae isolates
Mycoplasma synoviae expressed variable lipoprotein haemagglutinin (VlhA) is believed to play a major role in pathogenesis of the disease by mediating adherence and immune evasion. The aim of this study was sequencing Iranian M. synoviae isolates for the detection of nucleotide variation in the M. synoviae vlhA gene. Using oligonucleotide primers complementary to the single-copy conserved 5´ end...
متن کاملComparative genomics of human stem cell factor (SCF)
Stem cell factor (SCF) is a critical protein with key roles in the cell such as hematopoiesis, gametogenesis and melanogenesis. In the present study a comparative analysis on nucleotide sequences of SCF was performed in Humanoids using bioinformatics tools including NCBI-BLAST, MEGA6, and JBrowse. Our analysis of nucleotide sequences to find closely evolved organisms with high similarity by NCB...
متن کاملEctopic Expression of Embryo/Cancer Sequence A (ECSA) in KYSE-30 Cell Line Using Retroviral System
Background Human preimplantation embryonic cells share many similarities with cancer cells such as ability to self-renew, unlimited proliferation and maintenance of the undifferentiated state. Embryo-cancer sequence A (ECSA), also known as developmental pluripotency associated-2 (DPPA2), is a cancer testis antigen (CTA) with unclear biological function yet. Objective: CTAs are expressed normal...
متن کاملDetermination of aspartic protease gene dosage in the Onchocerca volvulusgenome
Aspartic proteases are a relatively small group of enzymes which express in various nematodes including Onchocerca volvulus. An estimation of the gene copy number corresponding to the OV7A clone, which contains a cDNA insert encoding approximately two-thirds of the entire coding sequence of aspartic protease of O. volvulus, was made by slot blot analysis in a closely related species O. gibsonig...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1707.01161 شماره
صفحات -
تاریخ انتشار 2017